🐿️ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
🧠 unified memory

gpu localized ai

Why I Ditched Malloc for AI Inference
gilli.dev·3d·
Discuss: Hacker News
🦀Rust
Huawei preps AI SSD to ease GPU memory bottlenecks
blocksandfiles.com·5d
🏢oxide computer
Hardware Technologies And Algorithms for Vector Symbolic Architectures (Purdue Univ., Georgia Tech)
semiengineering.com·6d
🦀Rust
When it comes to running Ollama on your PC for local AI, one thing matters more than most — here's why
windowscentral.com·6d
🏢oxide computer
MSNav: Zero-Shot Vision-and-Language Navigation with Dynamic Memory and LLM Spatial Reasoning
arxiv.org·6d
🤖agentic coding
AMD's Next-Gen UDNA: Four Die Sizes, One Potential 96-CU Flagship
techpowerup.com·5d·
Discuss: r/LocalLLaMA
🦀Rust
Designing AI factories: Purpose-built, on-prem GPU data centers
datasciencecentral.com·5d
💼ai-run businesses
Matrix Multiplication on Nvidia's Blackwell: Part 1 – Introduction
modular.com·2d·
Discuss: Hacker News
🦀Rust
LLM VRAM Usage Cut by 45x? What Jet-Nemotron Means for Local Users
hardware-corner.net·5d·
Discuss: Hacker News
🦀Rust
Building Mycelian Memory in Go: Long-Term Memory Framework for AI Agents
reddit.com·18h·
Discuss: r/golang
🦀Rust
NVIDIA details Blackwell Ultra GB300: dual-die design, 208B transistors, up to 288GB HBM3E
tweaktown.com·5d
🏢oxide computer
How AI Is Reshaping the Value of SSDs and DDR
dev.to·4d·
Discuss: DEV
💼ai-run businesses
VGG v GoogleNet: Just how deep can they go?
mayberay.bearblog.dev·2d
🤖agentic coding
Show HN: Paragon: A Go-native AI framework with WebGPU/Vulkan (no CUDA lock-in)
openfluke.com·6d·
Discuss: Hacker News
🦀Rust
Fast and Scalable Mixed Precision Euclidean Distance Calculations Using GPU Tensor Cores
arxiv.org·8h
🦀Rust
vLLM Performance Tuning: The Ultimate Guide to xPU Inference Configuration
cloud.google.com·6d
🏢oxide computer
Long Shot: augmenting COCONUT with a working memory
github.com·6d·
Discuss: r/LocalLLaMA
💼ai-run businesses
Why are CUDA kernels hard to optimize?
johndcook.com·4d·
Discuss: Hacker News
🦀Rust
Dynamic KV Cache Scheduling in Heterogeneous Memory Systems for LLM Inference (Rensselaer Polytechnic Institute, IBM)
semiengineering.com·3d
🦀Rust
Solving the compute crisis with physics-based ASICs
arxiviq.substack.com·19h·
Discuss: Substack
🏢oxide computer
Loading...Loading more...
AboutBlogChangelogRoadmap